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Coronavirus subgenomic mRNA (sgmRNA) synthesis occurs via a process of discontinuous transcription involving transcrip- 
tion regulatory sequences (TRSs) located in the 5’ leader sequence (TRS-L) and upstream of each structural and group-specific 
gene (TRS-B). Several gammacoronaviruses including infectious bronchitis virus (IBV) contain a putative open reading frame 
(ORF), localized between the M gene and gene 5, which is controversial due to the perceived absence of a TRS. We have studied 
the transcription of a novel sgmRNA associated with this potential ORF and found it to be transcribed via a previously unidenti- 
fied noncanonical TRS-B. Using an IBV reverse genetics system, we demonstrated that the template-switching event during in- 
tergenic region (IR) sgmRNA synthesis occurs at the 5’ end of the noncanonical TRS-B and recombines between nucleotides 5 
and 6 of the 8-nucleotide consensus TRS-L. Introduction of a complete TRS-B showed that higher transcription levels are 
achieved by increasing the number of nucleotide matches between TRS-L and TRS-B. Translation of a protein from the sgmRNA 
was demonstrated using enhanced green fluorescent protein, suggesting the translation of a fifth, novel, group-specific protein 
for IBV. This study has resolved an issue concerning the number of ORFs expressed by members of the Gammacoronavirus ge- 
nus and proposes the existence of a fifth IBV accessory protein. We confirmed previous reports that coronaviruses can produce 
sgmRNAs from noncanonical TRS-Bs, which may expand their repertoire of proteins. We also demonstrated that noncanonical 
TRS-Bs may provide a mechanism by which coronaviruses can control protein expression levels by reducing sgmRNA synthesis. 


he Gammacoronavirus infectious bronchitis virus (IBV) is an 

enveloped positive-sense, single-stranded RNA virus that is 
the etiological agent of the acute highly contagious poultry disease 
infectious bronchitis (IB) (1-4). Infectious bronchitis virus is a 
highly infectious pathogen of domestic fowl that replicates pri- 
marily in epithelial cells of the respiratory tract causing IB and is 
responsible for major economic losses to poultry industries 
worldwide as a result of poor weight gain and decreased egg pro- 
duction (5, 6). In addition, some isolates have been found to be 
associated with renal disease and can be highly nephropathogenic 
(7-9). 

The IBV genome is typical of other coronaviruses with gene 1, 
the replicase gene, located at the 5’ end of the genome and the 
structural and group-specific accessory genes clustered at the 3’ 
end. Additionally, for IBV and the closely related gammacorona- 
virus turkey coronavirus (TCoV), there is a region located be- 
tween the membrane (M) gene and the group-specific gene 5 re- 
ferred to as the intergenic region (IR), also known as open reading 
frame (ORF) 4b or ORF X (10-13). With the exception of the 
laboratory-adapted attenuated IBV Beaudette strain and some 
IBV vaccine isolates which contain deletions in this region, the IR 
contains a putative ORF with the potential to code for a protein of 
94 amino acids with a predicted molecular mass of 11 kDa. For 
both IBV and TCoV there has been speculation over the function 
of the IR-associated ORF due to the lack of identification of an 
associated transcription regulatory sequence (TRS) for the gener- 
ation of a subgenomic mRNA (sgmRNA) for expression of the 
11-kDa protein. 

The model of coronavirus transcription proposed by Sawicki 
and Sawicki (14) has led to the general acceptance that transcrip- 
tion of the structural and group-specific genes of coronaviruses 
occurs via a process of discontinuous transcription during nega- 
tive-strand synthesis (reviewed in references 15 to 17). A con- 
served sequence known as the TRS is located at the distal end of 
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the leader sequence (TRS-L) present at the very 5’ end of a coro- 
navirus genome and upstream of each of the structural or group- 
specific genes (TRS-B). During synthesis of the sgmRNAs, the 
TRS-B acts a signal for pausing the replication transcription com- 
plex. The TRS-B of the nascent negative-strand sgRNA is then able 
to complementarily base pair with the TRS-L of the genome, fa- 
cilitating a template switch, and transcription continues to the 5’ 
end of the genome. The negative-sense sgRNAs, with coterminal 
5’ and 3’ ends, are then transcribed into a nested set of positive- 
sense sgmRNAs from which generally the 5'-most ORF is trans- 
lated. Evidence for the model of discontinuous transcription 
during negative-strand synthesis came, in part, from evidence 
suggesting that the TRS of each sgmRNA was derived from the 
TRS-B and not the TRS-L (18-20). The precise mechanisms of 
sgRNA synthesis are, as yet, not fully understood although a num- 
ber of sequence elements, including the 5’ and 3’ flanking nucleo- 
tides of the TRS, have been identified that may have important 
roles (18, 20-26). 

Identification of TRSs for IBV and TCoV strains is based on the 
proposed consensus sequence CUUAACAA although some vari- 
ation in this sequence is seen; for example, the TRS-B of the IBV 
spike (S) and gene 3 is CUGAACAA. A canonical TRS closely 
matching this consensus sequence has not been identified up- 
stream of the IBV or TCoV IR, resulting in the belief that no 
sgmRNA is transcribed and that this region is therefore likely to be 
a pseudogene. Northern blot analysis of the IBV mRNA profile 
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variably identifies the presence of a low-abundance RNA located 
between the sgmRNAs representing the M gene and gene 5. De- 
spite the correlation of the presence of this RNA with the location 
of the IR, this RNA was originally classified as an unknown species 
(27) and has been rarely mentioned in the literature since. 

In this study, we used an IBV reverse genetics system (28-30) to 
produce a number of recombinant IBVs (rIBVs) to investigate the 
IR and confirmed the nature of the low-abundance RNA as an IBV 
sgmRNA from which an IR-associated ORF could be expressed. 
This study details the use of a noncanonical TRS-B in transcrip- 
tion of this ssmRNA and proposes the presence of a fifth novel 
group-specific protein for IBV and other closely related gamma- 
coronaviruses. Our study supports previous observations that 
coronaviruses are able to control the levels of sgmRNA expression. 
These findings offer the possibility that coronaviruses may pro- 
duce other sgmRNAs utilizing similar noncanonical TRS-Bs, con- 
sequently increasing the repertoire of accessory genes and provid- 
ing a potential mechanism by which this important group of 
viruses is able to control the expression of virus-derived proteins. 


MATERIALS AND METHODS 


Cells and viruses. Primary chicken kidney (CK) cells were prepared from 
2- to 3-week-old specific-pathogen-free (SPF) Rhode Island Red chickens. 
IBV Beau-R, Beau-CK, and all recombinant viruses were propagated and 
titrated on primary CK cells using BES [N,N-bis(2-hydroxyethyl)-2- 
aminoethanesulphonic acid] cell maintenance medium (30). Extracellu- 
lar virus was harvested 24 h postinfection (hpi) and titrated by plaque 
assay. IBV strains M41, CR88, D1466, H120, Italy-02, and QX were prop- 
agated in 10-day-old SPF Rhode Island Red embryonated hens’ eggs. Al- 
lantoic fluid was harvested at 24 h postinoculation and clarified by low- 
speed centrifugation. 

Construction of modified IBV cDNA plasmids. To delete the Beau- 
dette IR, the IBV Beau-R genome corresponding to the region of ORF 3c 
to the N gene was inserted into pGPTNEB193. The IR sequence, nucleo- 
tides (nt) 25192 to 25459 inclusive, was digested out using introduced 
restrictions sites NheI and KpnI, and an adapter of NheI-Smal-EcoRV- 
Mlul-KpnI was inserted. The modified region, which maintained the first 
three amino acids of the putative coding sequence as well as any potential 
unmodified upstream TRSs, was subcloned into pGPTNEB193 to yield 
pGPT-BeauRAIR. 

To modify the Beau-R TRS-L for generation of rIBVs BeauR-L-CTG 
AACAA and BeauR-L-CTTAACAT, the first 500 bp of the Beau-R ge- 
nome along with the preceding 550 bp of recombinant vaccinia virus 
(rVV) sequence was amplified by PCR from rVV-Beau-R/T7 and ligated 
into Pacl- and HindIII-digested pGPTNEB193 to yield plasmid pGPT- 
rVV/IBV. The 550-bp vaccinia virus sequence also contained the T7 pro- 
moter sequence immediately upstream of the 5’ end of the Beau-R ge- 
nome. To introduce mutations into the TRS-L of pGPT-rVV/IBV, the 
forward primer BeauR-L-CTGAACAA (5’-TAGATTTTTAACTGAAC 
AAAACGGA-3’) or BeauR-L-CTTAACAT (5'-TAGATTTTTAACTTAA 
CATAACGGAC-3’) was used together with the reverse primer BeauR- 
L-CTGAACAA = (5'-GTCCGTTTTGTTCAGTTAAAAATCTA-3’) — or 
BeauR-L-CTTAACAT (5’-GTCCGTTATGTTAAGTTAAAAATCTA-3’) 
to generate plasmids pGPT-L-CTGAACAA and pGPT-L-CTTAACAT. 

Plasmids pGPT-eGFPA4b and pGPT-IR-CTGAACAA, used to gener- 
ate rIBVs BeauR-eGFPA4b and BeauR-IR-CTGAACAA, respectively, 
were generated by GeneArt. Gene synthesis was carried out to incorporate 
either enhanced green fluorescent protein (eGFP) or mutations °°”°G > 
Cand*°°”4C — A into the region of Beau-R corresponding to the M gene 
to ORF 5a, followed by cloning of the synthesized genome segments into 
pGPTNEB193. 

Generation of recombinant vaccinia viruses containing modified 
IBV cDNA and recovery of infectious IBV. Modified regions of IBV 
cDNA within plasmids pGPT-BeauRAIR, pGPT-L-CTGAACAA, pGPT- 
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L-CTTAACAT, pGPT-eGFPA4b, and pGPT-IR-CTGAACAA were intro- 
duced into the IBV Beaudette full-length cDNA within the vaccinia virus 
genome by homologous recombination using transient dominant selec- 
tion, as described previously (29, 30). Infectious rIBVs were recovered 
from recombinant vaccinia viruses containing the correctly modified IBV 
cDNAs and passaged three times on CK cells prior to experimental use. 

Leader-body junction analysis. Intracellular RNA was _ reverse 
transcribed using SuperScript II (Invitrogen) with the random primer 
5'-GTTTCCCAGTCACGATCNNNNNNNNNNNNNNN-3’. Resulting 
cDNAs were amplified by PCR with a forward primer located within the 5’ 
leader sequence (for IBV, 5'’-CTATTACACTAGCCTTGCGC-3’; for 
TCoV, 5'-CTATCATACTAGCCTTGTGC-3’) and the reverse primer 5’- 
GACCACATCCTACAACAACC-3’ located within the IR sequence. Tem- 
plates were denatured at 94°C for 2 min, followed by 25 cycles of 94°C for 
30 s, 50°C for 30 s, and 72°C for 30 s. PCR products were cloned using a 
TOPO TA Cloning Kit for Sequencing (Life Technologies). 

Northern blot analysis. Intracellular RNA was extracted from CK 
cells at 24 hpi using an RNeasy Mini Kit following the protocol for animal 
cells (Qiagen) with homogenization using a TissueLyser II for 30 s at 25 Hz 
(Qiagen). mRNA was purified using a Poly(A)Purist MAG Kit (Ambion), 
as per the manufacturer’s instructions. Northern blot analysis was carried 
out with a NorthernMax-Gly Kit (Ambion). Briefly, viral mRNA tran- 
scripts were denatured in glyoxal loading dye at 50°C for 30 min, followed 
by separation on a 0.8% low-electroendoosmosis (LE) agarose gel. RNA 
was transferred to BrightStar-Plus positively charged nylon membrane 
(Ambion) using capillary action for 2 h, cross-linked by treatment with 
UV light using the auto-cross-link function on a Stratalinker UV Cross- 
linker (Stratagene), and prehybridized for 30 min with ULTRAhyb buffer 
at 42°C. Blots were probed with a cDNA probe specific to the 3’ end of IBV 
[forward primer, 5’-CAACAGCGCCCAAAGAAG-3’, within the N gene; 
reverse primer, 5’-GCTCTAACTCTATACTAGCCT-3’, directly preced- 
ing the poly(A) tail] and labeled using a BrightStar Psoralen-Biotin Non- 
isotopic Labeling Kit (Ambion). Blots were hybridized overnight at 42°C, 
followed by washing and development with a BrightStar BioDetect Kit. 

Growth kinetics of rIBVs. Individual wells of six-well plates of con- 
fluent CK cells were infected with 1 X 10° PFU of virus and incubated for 
1 h at 37°C in 5% CO,, after which cells were washed twice with phos- 
phate-buffered saline (PBS), and 2 ml of fresh 1X BES medium was 
added. Extracellular virus was harvested at 1, 8, 12, 24, 48, and 72 h and 
subsequently analyzed in triplicate by plaque assay on CK cells for progeny 
virus. 

IR protein identification. Antibody generation was carried out by 
Cambridge Research Biochemicals (United Kingdom). Briefly, a synthetic 
peptide corresponding to amino acids 68 to 84 (DNGKVYYEGKPIFQ 
KGC) of the IBV M41 IR protein was synthesized and used to immunize 
two rabbits. Crude antisera were pooled and purified by affinity chroma- 
tography on thiopropyl-Sepharose 6B coupled with antigen. Samples of 
M41-infected CK cell lysates were separated on 10% bis-Tris precast poly- 
acrylamide gels (Life Technologies). Separated proteins were transferred 
to nitrocellulose membrane (Amersham Hybond ECL) and probed with 
the purified IR antibody. Potentially bound antibody was visualized using 
an enhanced-chemiluminescence detection system (ECL) (Millipore). 

Real-time PCR. RNA was harvested from CK cells at 24 hpi as de- 
scribed previously and reverse transcribed using 0.5 wg of RNA per reac- 
tion. The following cDNAs were synthesized at 48°C for 0.5 h followed by 
inactivation of the reverse transcriptase at 95°C for 5 min with TaqMan 
Reverse Transcription Reagents (ABI) and 2 pmol of reverse primer (for 
288, 5’-GACGACCGATTTGCACGTC-3’; IBV gene 3, 5’-TGGGACTTT 
GGATCATCAAACA-3’; (IBV IR, 5’-GCATAGACAAACGTAGCAAAC 
CTTT-3’). The synthesized cDNAs were used as templates for specific 
PCR amplification with the reverse primer as above and a forward primer 
for 28S (5'-GGGGAAGCCAGAGGAAACT-3’) or IBV cDNAs (5'-CTA 
GCCTTGCGCTAGATTTTTAACT-3’) and a probe for 28S (6-FAM-AG 
GACCGCTACGGACCTCCACCA-TAM) IBV gene 3 (6-FAM-CAATAC 
AGACCTAAAAAGT-MGB), or IBV IR (6-FAM-ACAAAGCGGAAATA 
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FIG 1 Characterization of an rIBV with the IR deleted. The IR of Beau-R was deleted, resulting in rIBV Beau-RAIR. (A) Schematic diagram showing the genome 
organizations of Beau-R and Beau-RAIR. BeauRAIR contains a 268-nt deletion, downstream of amino acid 3, reducing the IR sequence of Beau-R to 37 nt in length. (B) 
Growth kinetics of Beau-RAIR. Monolayers of CK cells were infected with 1 X 10° PFU of Beau-R or BeauRAIR, and extracellular virus was harvested at 1, 8, 12, 24, 48, 
and 72 h. Virus was titrated in triplicate by plaque assay on CK cells. Error bars represent standard deviations of the mean results from three independent experiments. 
(C) Northern blot analysis of the ssmRNAs produced by BeauRAIR. Lane 1, Beau-R; lane 2, BeauRAIR. The asterisk indicates an additional sgmRNA derived from within 
the S gene. L represents the leader sequence at the 5’ end of the IBV genome, E the envelope protein gene, and N the nucleoprotein gene. 


A-MGB) (where FAM is 6-carboxyfluorescein, TAM is 6-carboxy- 
tetramethylrhodamine, and MGB is minor groove binder). All primers 
and probes were designed using Primer Express software. Each 10-l real- 
time PCR mixture consisted of 5 wl of TaqMan Fast Universal PCR Master 
Mix (2X), 0.2 pl ofa 1 wM primer mix, 0.25 pl of probe (5 wM), 2.05 pl 
of water, and 2.5 pl of cDNA. All reactions were performed in triplicate in 
a 7500FAST TaqMan machine (Applied Biosystems) at 95°C for 20 s, 
followed by 40 cycles of 95°C for 3 s and 60°C for 30 s. Amplification plots 
were analyzed using Applied Biosystems Sequence Detection Software, 
version 1.3.1.21. 


RESULTS 


Identification of an mRNA corresponding to the IBV intergenic 
region. Northern blot analysis of the IBV mRNA profile has iden- 
tified a low-abundance RNA of unknown origin located between 
the sgmRNAs representing the M gene and gene 5. Data obtained 
while working on a number of rIBVs led to the suggestion of a link 
between the IR ORF and this low-abundance RNA species. To 
investigate this association in more depth, we used our IBV reverse 
genetics system to construct rIBV BeauRAIR, a recombinant virus 
based on IBV Beau-R, a molecular clone of the Beaudette-CK 
strain of IBV (28). Recombinant IBV BeauRAIR contains a 268-nt 
deletion within the IR beginning at amino acid 4 of the putative 
coding sequence (Fig. 1A). The deleted region did not disrupt 
sequences required for transcription of upstream and down- 
stream genes and was designed to ensure that any potential up- 
stream regulatory sequences that would be required for transcrip- 
tion of an sgmRNA for the IR were maintained. Growth analysis of 
this virus in primary chicken kidney (CK) cells showed that it 
displayed kinetics and peak titers equivalent to those of wild-type 
Beau-R (Fig. 1B). Northern blot analysis of the RNA species pro- 
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duced by rIBV BeauRAIR showed a reduction in size of the low- 
abundance RNA species in question and in the other larger 
sgmRNAs, consistent with the size of the deletion introduced into 
the Beau-R IR, providing a strong indication that this RNA is an 
sgmRNA corresponding to the IR of IBV Beaudette (Fig. 1C). 

As no specific TRS has ever been identified upstream of the IR 
ORF, the question of how this potential sgmRNA is transcribed 
was addressed. Intracellular RNA was extracted from CK cells in- 
fected with a variety of field and vaccine strains of IBV, and 
sgmRNAs were amplified by leader-body junction reverse tran- 
scription-PCR (RT-PCR), as described previously (28). An 
sgmRNA corresponding to the IR was identified as a PCR product 
of approximately 400 bp (350 bp for Beau-R and parent virus 
Beau-CK) (Fig. 2A). Sequence analysis of the RT-PCR products 
confirmed that each virus produced an sgmRNA corresponding to 
the IR as each RT-PCR product contained the IBV leader sequence 
followed by the canonical IBV TRS, CUUAACAA, as expected for 
all sgmRNA transcripts (Fig. 2B). However, sequence alignments 
showed that only positions 6 to 8, CAA, of the IBV TRS and not the 
entire consensus sequence had perfect homology to the IBV 
genomic sequence upstream of the IR. This CAA sequence is lo- 
cated approximately 100 nt upstream of the proposed AUG for the 
IR ORF, a distance consistent with distances observed for the cur- 
rently known IBV TRS-Bs and associated genes. These results con- 
firmed that all the IBV strains analyzed produce an sgmRNA cor- 
responding to the IR and suggested that transcription of this new 
sgmRNA species of IBV is initiated from a noncanonical TRS-B 
where the consensus 8-nucleotide TRS of CUUAACAA is reduced 
to CAA. 

The presence and sequence of the IR-derived sgmRNA were 
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FIG 2 Analysis of leader-body TRS junctions of IR sgmRNAs of IBV and TCoV 


isolates. Intracellular RNA was extracted from CK cells 24 hpi with IBV or from 


turkey embryos inoculated with TCoV strain FRO80385d. Virus mRNAs were RT-PCR amplified with primers specific to leader sequence and the IR for both IBV 
and TCoV. (A and C) Agarose gel electrophoresis of RT-PCR products amplified from IBV or TCoV mRNAs, respectively. Lanes L, molecular size ladder. (B and 
D) Sequence analysis of the IR mRNA transcripts from IBV-infected CK cells or TCoV-inoculated turkey embryos, respectively. The virus genome sequence is 


shown below, and the mRNA sequence is shown above. The leader-body TRS 


junction is marked in bold, the unaligned genome sequence upstream of the 


leader-body junction is underlined, and the initiation codon ATG of the 11-kDa ORF encoded in the IR is shown at the 3’ end. The relative nucleotide position 
of the ATG from the end of the IR mRNA (top sequence) or from the 5’ end of the Beaudette genome (bottom sequence) is given. 


also confirmed using RNA extracted from the tracheas of birds 
experimentally infected with the M41 strain of IBV, thus ruling 
out the possibility that this sgmRNA is an artifact of cell culture 
infection (data not shown). Given the identification of the 11-kDa 
ORF sequence in the closely related gammacoronavirus TCoV, a 
BLASTn search was carried out on a 50-nt region covering the 
proposed IBV IR TRS site and was found to have 98% homology 
with several TCoV sequences (data not shown), suggesting a pos- 
sible conserved mechanism of transcription. Analysis of RNA ex- 
tracted from turkey embryos experimentally infected with TCoV 
strain FRO80385d confirmed that an sgmRNA corresponding to 
the TCoV IR ORF is also transcribed using a similar noncanonical 
TRS-B to IBV (Fig. 2C and D). Further investigation of available 
Gammacoronavirus genome sequences revealed that viruses iso- 
lated from at least three other avian species also contain both the 
putative coding region and the conserved region of the nonca- 
nonical TRS: duck coronavirus (JF705860), partridge coronavirus 
(AY646283), and pheasant coronavirus ph/UK/602/95 (D. Ca- 
vanagh and P. Britton, unpublished data). 

The IBV sgmRNA transcribed via the noncanonical TRS can 
be translated. To date, the 11-kDa 94-amino-acid protein se- 
quence encoded by the IR-associated ORF has only been discussed 
with regard to its potential to be translated due to the lack of proof 
of an sgmRNA. Confirmation that an IR sgmRNA is produced 
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during an IBV infection suggested that the IR-associated 11-kDa 
protein is translated but has yet to be identified. To begin to ad- 
dress this issue, a synthetic peptide corresponding to the IBV M41 
IR sequence was generated, and antibodies were raised against this 
peptide. However, Western blot analysis of IBV M41-infected CK 
cells proved inconclusive in identifying the 11-kDa protein when 
cells were probed with this IR antibody. 

To strengthen the case for the existence of a fifth accessory 
protein of IBV, an alternative method utilizing our IBV reverse 
genetics system was pursued. An rIBV, BeauR-eGFPA4b, was en- 
gineered that replaced the IR ORF with that of enhanced green 
fluorescent protein (eGFP) so that the AUG of the 11-kDa ORF 
became the initiation codon for eGFP (Fig. 3A). Following infec- 
tion of CK cells with rIBV BeauR-eGFPA4b, fluorescence could be 
observed in infected cells (Fig. 3B). The eGFP sgmRNA showed 
the same TRS usage as that of the IR and confirmed that a protein 
can be translated from this newly confirmed IBV sgmRNA tran- 
scribed using the proposed noncanonical TRS-B (Fig. 3C). 

Determination of the origin of the TRS in the IR sgmRNA. In 
accordance with the mechanism of coronavirus sgmRNA tran- 
scription, we hypothesized that leader-body TRS base pairing for 
the de novo synthesized IR sgmRNA was centered on the CAA 
sequence of the noncanonical TRS-B and the complementary 
nucleotides of the TRS-L. This would suggest, therefore, that nu- 
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FIG 3 Translation of eGFP encoded within the IR mRNA. The putative coding region of the Beau-R IR was replaced with eGFP, generating rIBV BeauR- 
eGFPA4b. (A) Schematic diagram of the genome organization of BeauR eGFPA4b. (B) Fluorescence microscopy of CK cells infected with BeauR-eGFPA4b. 
Images were taken at 18 hpi at X10 magnification. (C) Sequence analysis of the eGFP mRNA transcript expressed by BeauR-eGFPA4b in CK cells. The virus 
genome sequence is shown below, and the mRNA sequence is above. The leader-body TRS junction is in bold, unaligned genome sequence is underlined, and the 


IR ATG is shown at the 3’ end relative to the leader-body junction as in Fig. 2. 


cleotide positions 1 to 5 of the sgmRNA TRS are derived from the 
TRS-L, following a recombination event and template switch be- 
tween nucleotides 5 and 6 of the CUUAACAA TRS-L. To test this 
hypothesis, two point mutations were individually engineered 
into the TRS-L of Beau-R resulting in two rIBVs: (i) BeauR-L-CT 
GAACAA, which contained a U > G mutation at nucleotide po- 
sition 3, and (ii) BeauR-L-CTTAACAT, which contained an A > 
U mutation at nucleotide position 8 of the consensus TRS-L. Both 
rIBVs replicated efficiently in CK cells, achieving peak titers of 
1.9 X 10” PFU/ml and 1.8 X 10” PFU/ml, respectively, and dis- 
played similar growth kinetics as observed for wild-type Beau-R 
(Fig. 4A). 

The sequences of the leader-body TRS junctions of the IR 
sgmRNAs produced by the two rIBVs were investigated as de- 
scribed previously. Sequence analysis showed that the TRS of the 
IR sgmRNA produced by rIBV BeauR-L-CTTAACAT was CUUA 
ACAA, the same as for Beau-R. This result confirmed that nucle- 
otides 6 to 8, CAA, of the sgmRNA TRS are derived from the 
truncated TRS-B associated with the generation of an sgmRNA for 
the IR. However, the TRS of the IR sgmRNA produced by rIBV 
BeauR-L-CTGAACAA showed incorporation of the U > G mu- 
tation at nucleotide position 3, demonstrating the involvement of 
nucleotides 1 to 5 from the TRS-L in forming the TRS of the IR 
sgmRNA (Fig. 4B). The incorporation of the nucleotide change at 
position 3, but not at position 8, is consistent with our hypothesis 
that the template switch during synthesis of the IR sgmRNA oc- 
curs upstream of the truncated CAA TRS-B and suggests a recom- 
bination point between nucleotides 5 and 6 of the conserved CU 
UAACAA TRS-L, a finding that demonstrates that in this case the 
full-length, 8-nucleotide TRS-B of IBV is not required to initiate 
template switching. 

The noncanonical TRS-B is responsible for reduced sgmRNA 
levels. Northern blot analysis of the RNA species produced by 
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FIG 4 Characterization of rIBVs containing point mutations within the 
TRS-L or TRS-B of Beau-R. (A) Growth kinetics of the rIBVs. Monolayers of 
CK cells were infected with 1 X 10° PFU of each virus, and extracellular virus 
was harvested at 1, 8, 12, 24, 48, and 72 h. Virus was titrated in triplicate by 
plaque assay on CK cells. Error bars represent standard deviations of the mean 
results from three independent experiments. (B) Sequence analysis of the IR 
mRNA transcripts expressed in CK cells infected with rIBVs BeauR-L-CTGA 
ACAA or BeauR-L-CTTAACAT. The virus genome sequence is shown at the 
bottom, the sequence of the IR mRNA from Beau-R is shown in the middle, 
and the sequences of the IR mRNAs from the rIBVs are shown at the top. 
Leader-body TRS junctions are in bold, unaligned genome sequence is under- 
lined, and the IR ATG is shown at the 3’ end relative to the leader-body 
junction as in Fig. 2. Arrows indicate sites of point mutations in the Beau-R 
TRS-L. 
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FIG 5 Analysis of IR sgmRNA levels expressed in rIBVs with TRS mutations. Intracellular RNA was harvested from CK cells at 24 hpi. (A, B, and C) Northern 
blot analysis of the mRNAs expressed by the rIBVs. Lanes 1, Beau-R; lanes 2, r[BV BeauR-L-CTGAACAA, BeauR-L-CTTAACAT, or BeauR-IR-CTGAACAA, 
respectively. (D) Quantification of the IR mRNA levels by qRT-PCR using 28S expression as an internal standard. Values were additionally normalized to gene 
3 sgmRNA levels to account for variations in infection and replication rates. Error bars represent the standard deviations of the means of two independent 
experiments. (E) Sequence analysis of the BeauR-IR-CTGAACAA IR mRNA transcript expressed in CK cells. The virus genome sequence is shown below, the 
mRNA sequence is shown above. The leader-body TRS junction is in bold, unaligned genome sequence is underlined, and the IR ATG is shown at the 3’ end 


relative to the leader-body junction as in Fig. 2. 


rIBVs with a modified TRS-L showed, in comparison to Beau-R, a 
reduction in the IR sgmRNA level from rIBV BeauR-L-CTTA 
ACAT but not from rIBV BeauR-L-CTGAACAA (Fig. 5A and B). 
This is consistent with previous reports for other coronaviruses 
describing the disruption of leader-body TRS junctions of group- 
specific genes (31). Given the reduced level of the IR sgmRNA 
observed from rIBV BeauR-L-CTTAACAT and the low and vari- 
able levels observed from Beau-R, we hypothesized that introduc- 
tion of a complete consensus TRS-B would increase the hybrid- 
ization efficiency between TRS-B and TRS-L, resulting in an 
increase in the amount of IR sgmRNA produced. To test this hy- 
pothesis, two point mutations were engineered into the Beau-R 
genome, *°°”°G — C and *°”*C — A, resulting in an IR TRS-B 
with the sequence CUGAACAA, equivalent to the TRS-B for the S 
gene and gene 3 of IBV Beaudette (Table 1). A TRS-B with the 
sequence CUGAACAA was adopted because the introduction ofa 
modified TRS-B with the consensus sequence CUUAACAA 


TABLE 1 Sequence context of the TRS-B sequences in the IBV 
Beaudette genome 


IBV sgmRNA Sequence* 

Leader TTAACTTAACAAAACGGAC 
S AAAACTGAACAAAAGACAG 
Gene 3 GTAACTGAACAATACAGAC 
M AAAACTTAACAATCCGGAA 
IR ACTGGTGACCAAAGCGGAA 
Gene 5 AAAACTTAACAAATACGGA 
N CTTTCTTAACAAAGCAGGA 


* The consensus TRS is highlighted in bold. The flanking nucleotides, both 5’ and 3’ of 
the consensus TRS for generating each sgmRNA, are presented to show the nucleotides 
5’ and 3’ of the TRS that may hybridize with the corresponding flanking nucleotides of 
the TRS-L. Nucleotides 3’ to the IR TRS-B showing homology to TRS-L flanking 
nucleotides are underlined. 
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would have resulted in a premature stop within the essential M 
gene. The introduction of these nucleotide changes gave rise to 
two amino acid changes, G189A and D190E, respectively, within 
the M protein. An rIBV, BeauR-IR-CTGAACAA, containing the 
two mutations was produced, and sequence analysis confirmed 
that the TRS-B for generating the ssmRNA was modified as ex- 
pected (data not shown). 

The growth kinetics of rIBV BeauR-IR-CTGAACAA were 
found to be similar to that of Beau-R although peak titers at 24h 
were approximately 1 log), lower (Fig. 4A), likely due to the amino 
acid changes within the M protein. Northern blot analysis of the 
RNA species produced by rIBV BeauR-IR-CTGAACAA demon- 
strated an observable increase in the amount of the IR ssgmRNA to 
a level more comparable with upstream and downstream 
sgmRNA levels (Fig. 5C), with no apparent effect on the larger 
sgmRNA species. These results were paralleled by quantitative 
RT-PCR (qRT-PCR) analysis, which showed a 6-fold increase in 
the amount of the IRsgmRNA from rIBV BeauR-IR-CTGAACAA 
anda 0.1-fold change in the amount of the IR sgmRNA from rIBV 
BeauR-L-CTTAACAT compared to levels from Beau-R (Fig. 5D). 
As with Northern blot analysis (Fig. 5A), no change in IR level was 
observed for rIBV BeauR-L-CTGAACAA. This result was ex- 
pected based on the previous finding that nucleotides 1 to 5 of the 
TRS-L are not involved in hybridization with the IR TRS-B, and 
thus alterations to this sequence will not influence hybridization 
efficiency and, therefore, subsequent sgmRNA levels. 

Leader-body junction RT-PCR analysis of the IR sgmRNA 
from rIBV BeauR-IR-CTGAACAA confirmed the sequence of the 
leader-body junction TRS as CUGAACAA (Fig. 5E), indicating 
that the IR sgmRNA TRS was now wholly derived from the mod- 
ified, near-consensus TRS-B, as predicted by the model of coro- 
navirus sgmRNA synthesis. The stability of the modifications to 
the IR TRS was investigated by serial passage of rIBV BeauR-IR- 
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CTGAACAA on CK cells. The modified IR TRS-B sequence was 
found to be maintained within the Beau-R genome for up to at 
least eight passages. As has also been observed by others (20, 31), 
these results demonstrated not only that the TRS-B naturally reg- 
ulates sgmRNA levels but also that stable modifications can be 
made to the TRS-B that alter the level of transcription of the asso- 
ciated gene. 


DISCUSSION 


This work has for the first time identified and confirmed that an 
RNA species previously observed at low levels in IBV-infected cells 
and with no known origin is an IBV sgmRNA. This sgmRNA is 
capable of expressing an uncharacterized ORF identified in IBV 
(10), as well as in other closely related gammacoronaviruses such 
as TCoV (11, 13). Coronaviruses possess the largest RNA ge- 
nomes, sharing a similar gene order in which genes encoding 
structural and group-specific accessory proteins are clustered at 
the 3’ end of the genome. Together with the related arteriviruses, 
coronaviruses have evolved a distinct mechanism for expression 
of the structural and accessory proteins involving translation from 
a3’ and5’ coterminal nested set of sgmRNAs. These ssmRNAs are 
copied from a series of negative-sense RNA templates synthesized 
from the genomic RNA by a mechanism involving a discontinu- 
ous step that adds the common 5’ leader sequence, derived from 
the 5’ end of the genomic RNA, to each sgRNA (14, 16, 32). It has 
previously been shown that an important factor regulating the 
process of generating negative-sense RNA templates is the TRS 
(18-20, 25, 33). A TRS consists of a short AU-rich sequence lo- 
cated at both the 5’ leader sequence (TRS-L) and upstream of each 
structural or group-specific gene (TRS-B) and is known to facili- 
tate the template switch required to complete the synthesis of each 
negative-sense copy of an sgmRNA. 

In studying the origin of a potential ssmRNA for the transla- 
tion of the IR ORF of IBV and TCoV (also called ORF 4b or ORF 
X), we have identified the existence of a naturally occurring 
sgmRNA for a coronavirus that contains a consensus TRS derived 
predominantly from the TRS-L (Fig. 4B), indicating that the tem- 
plate switch has occurred at a position within the conserved se- 
quence of the TRS-L. The TRS-B for the IR sgmRNA was found to 
consist of only three nucleotides, CAA, corresponding to posi- 
tions 6 to 8 of the highly conserved IBV CUUAACAA TRS, a 
finding that begins to explain the previous lack of identification of 
a TRS-B associated with the IR ORF and the resulting uncertainty 
over the function of this genome region (Fig. 2B and D). We have 
demonstrated that the newly confirmed IBV IR sgmRNA was gen- 
erated using an identical noncanonical TRS-B for a variety of vac- 
cine and field isolates of IBV and was also detected in tracheal cells 
isolated from IBV-infected chickens. Complete genome sequenc- 
ing of the closely related gammacoronavirus TCoV had revealed 
the presence of a sequence resembling the IBV IR for which the 
authors suggested a potential TRS-B, GUCAACAA, located 288 nt 
upstream of the initiation codon (13). We have demonstrated, 
using TCoV RNA isolated from infected turkey embryos, that an 
IR sgmRNA is produced from an identical position to that iden- 
tified in IBV, approximately 100 nt upstream of the initiation 
codon and utilizing a TRS-B of CAA as identified for IBV. 

Although the synthesis of ssmRNAs from noncanonical TRSs 
has been reported previously (34, 35), in this case the noncanoni- 
cal TRS is a truncation of the consensus CUUAACAA TRS, and 
the leader-body fusion site contains the canonical TRS, CUUAA 
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CAA, as expected based on other IBV sgmRNAs. The introduction 
of point mutations into the TRS-L demonstrated that nucleotides 
1 to 5 of the leader-body fusion site were derived from the TRS-L, 
with nucleotides 6 to 8 derived from the TRS-B, indicating that the 
template switch for the IR ssmRNA was occurring upstream of 
the noncanonical CAA TRS-B and between nucleotides 5 and 6 of 
the TRS-L. Although the mechanisms for controlling the template 
switching event during coronavirus and arterivirus discontinuous 
transcription are not yet fully understood, there is increasing evi- 
dence for the requirement of additional sequence elements that in 
some way control the process, whether by acting at the sequence 
level or through formation of specific RNA structures (20-24, 26, 
36), as well as the potential for RNA-protein interactions (17, 
37-40). In particular the 5’ and 3’ sequences flanking the TRS 
have been shown to be important, with previous reports for trans- 
missible gastroenteritis virus (TGEV) demonstrating that the four 
nucleotides either side of the core sequence of CUAAAC were 
important for predicting mRNAs at noncanonical junction sites 
(25, 26). 

As the sequence of the IR sgmRNA noncanonical TRS, CAA, 
occurs approximately 280 times throughout the Beau-R genome 
without generating sgmRNAs, it is likely that flanking sequences 
have an important role to play in initiating recognition of the IR 
TRS-B and the subsequent template switch. Analysis of the 
Beau-R genome shows regions of homology, both 5’ and 3’ to the 
noncanonical TRS-B, with the TRS-L and flanking sequences (Ta- 
ble 1); specifically, 5 of 7 of the IR 3’ flanking nucleotides (Table 1, 
underlined) are complementary to the leader sequence and are 
likely involved in establishing this sequence as a site for a template 
switch and sgmRNA synthesis. 

We have demonstrated, by producing rIBVs with specific mod- 
ifications to either the TRS-L or TRS-B, that the sequence of a TRS 
is directly responsible for regulating sgmRNA levels, as also pre- 
viously suggested (20) (Fig. 5). Recombinant virus BeauR-IR-CT 
GAACAA, with the introduction of a complete TRS-B for the IR 
matching the IBV S and gene 3 TRS-Bs of CUGAACAA, resulted in 
an increase in IR sgmRNA levels to levels more consistent with 
those of upstream and downstream genes. In contrast, rIBV 
BeauR-L-CTTAACAT showed a decrease in IR sgmRNA levels as 
a result of possessing only two nucleotide matches between the 
TRS-L and TRS-B at positions 6 and 7. The fact that IR ssmRNAs 
could still be detected by standard and real-time PCR with only 
two matching nucleotides further strengthens the argument that 
homology in flanking sequences is required for recognition of a 
TRS-B and the subsequent pause/stop of the replication complex. 
Analysis of the ssmRNA species produced by the TRS-L mutants 
did not reveal the presence of additional ssmRNAs generated by 
activation of cryptic TRS sites, as has been observed previously 
(31, 33). This was particularly interesting to note during the anal- 
ysis of rIBV BeauR-L-CTTAACAT. Although an exact CUUA 
ACAU sequence is found in the Beau-R genome beginning at nu- 
cleotide 24685, within the M gene, no sgmRNA was transcribed 
via this potential TRS, demonstrating, as has been shown previ- 
ously (23), that the conserved TRS is not sufficient for sgmRNA 
synthesis. Further work is needed to investigate the effects of in- 
creasing/decreasing the homology of 5’ and 3’ flanking sequences 
for IBV TRSs and to establish which nucleotides are essential for 
optimal TRS recognition and template switching. 

Northern blot analysis of rIBV BeauRAIR also revealed the 
presence of an additional RNA species slightly smaller than the S 
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sgmRNA and indicated by an asterisk in Fig. 1C. Investigation of 
this RNA revealed that it, too, was an sgmRNA with a leader-body 
junction of CUUAACAA and a noncanonical TRS-B, in this in- 
stance encompassing nucleotides 4 to 8, AACAA, of the consensus 
IBV TRS. This additional sgmRNA has a potential ORF in frame 
with the S gene sequence and encompassing amino acids 321 to 
1162 of the full-length protein. However, it was not possible to 
detect this ssmRNA from any IBV strains other than Beau-R and 
the parent virus Beau-CK, suggesting that it is an artifact of the 
repeated passage of Beaudette on CK cells to generate the 
Beau-CK virus. It is unclear if this ssmRNA has any function dur- 
ing Beau-R/Beau-CK infection, but its presence provides evidence 
that the use of noncanonical TRS-Bs, in the form of shortened 
TRSs, may be more prevalent than suspected. A similar, truncated 
S protein has previously been reported for severe acute respiratory 
syndrome (SARS)-CoV that could be translated from a novel 
sgmRNA also generated via a noncanonical TRS (35). 

Recombinant IBV BeauR-IR-CTGAACAA was not able to rep- 
licate to the peak levels observed for parental virus Beau-R in cell 
culture (Fig. 4A). The amino acid changes that arose within the M 
protein as a result of the point mutations introduced to create the 
CUGAACAA TRS-B are proposed as the reason for this. We sug- 
gest that the IR TRS-B could be viewed in terms ofa series of point 
mutations that have arisen within the IR TRS-B in order to evolve 
a more viable M protein, thus resulting in a virus with improved 
replication. While the extended TRS-B was maintained by the 
virus, by passage 8 additional nucleotide mutations had arisen that 
resulted in a further amino acid change of A189T, suggesting the 
acquisition of compensatory mutations to counteract the loss of 
viability of M. 

An alternative explanation for the nature of the IR TRS-B is 
that the noncanonical, truncated TRS-B has evolved to control the 
expression of the potential 11-kDa ORF protein through the gen- 
eration of a low-abundance sgmRNA. Coronaviruses and, indeed, 
all viruses belonging to the Nidovirales order have evolved an ex- 
tensive range of mechanisms for controlling expression of their 
genes. These include —1 frameshifting to control the expression 
levels of the replicase proteins and the generation of a nested set of 
sgmRNAs for the expression of structural and accessory proteins, 
some of which require translation via leaky scanning or use of 
internal ribosome entry sequences. 

The presence of natural deletions or truncations within the 
11-kDa ORF gene of some IBV strains suggests a nonessential 
accessory role for the proposed protein. Previous work has shown 
that the IBV accessory proteins produced by genes 3 and 5 and 
accessory proteins of other coronaviruses can be deleted with no 
major impact on virus replication in cell culture (41-44). Here, we 
replaced the truncated 11-kDa ORF of Beau-R with eGFP and 
showed that the resultant rIBV expressed eGFP in cell culture (Fig. 
3B). This demonstrated the translation of a protein from the IR 
sgmRNA and implies the expression ofa fifth accessory protein by 
IBV. Attempts to identify the 11-kDa protein in cell culture using 
antibodies raised against the IBV M41 IR sequence have been un- 
successful. 

In conclusion, in this study we have demonstrated the link 
between a previously uncharacterized RNA species of IBV and an 
ORE located between the M gene and gene 5. The identification 
and confirmation of a new sgmRNA for IBV and TCoV, tran- 
scribed via a noncanonical, shortened version of the consensus 
IBV/TCoV TRS-B, has revealed the possibility that template 
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switching can occur within the conserved TRS-L in the absence of 
a full-length complementary TRS-B. The use of noncanonical 
TRS-Bs for synthesis of sgmRNAs may offer a potential mecha- 
nism by which coronaviruses can expand their repertoire of pro- 
teins. We propose that the IR sgmRNA is the template for the 
translation of a novel 11-kDa accessory protein of IBV and closely 
related gammacoronaviruses, bringing the total number of group- 
specific accessory proteins to five. It now remains important to 
identify this protein and establish its role in gammacoronavirus 
infection. 
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